FDMSM robust signal representation for speech mixtures and noise corrupted audio signals
نویسندگان
چکیده
The fixed dimension modified sinusoidal model (FDMSM) was recently proposed as an attractive candidate for compact representation of audio signals in adverse conditions. This paper aims to study the capability of the FDMSM signal representation for analysis and synthesis of speech mixtures as well as noisy audio signals corrupted by highly colored noise of babble and harmonic. Extensive simulation results verified that the FDMSM provides high perceptual quality of the synthesized output signal compared with the conventional harmonic plus noise model (HNM) for both speech mixtures as well as audio signals corrupted by various types of noise.
منابع مشابه
روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه
Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...
متن کاملA New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain
Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...
متن کاملRobust Time-Varying Filtering of Speech Signals Corrupted by Mixed Gaussian and Impulse Noise
A robust time-varying filtering procedure for speech signals corrupted by mixed Gaussian and impulse noise is presented. It is based on the robust time-frequency distributions that can provide efficient representation of the noisy speech signals. The proposed approach has been compared with the time-varying filtering procedure based on the standard time-frequency distributions.
متن کاملSpeech Enhancement using Adaptive Data-Based Dictionary Learning
In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...
متن کاملA New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEICE Electronic Express
دوره 6 شماره
صفحات -
تاریخ انتشار 2009